Structured optical receivers to attain superadditive capacity and the Holevo limit 
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When classical information is sent over a quantum channel, attaining the ultimate limit to channel 
capacity requires the receiver to make joint measurements over long codeword blocks. For a pure- 
state channel, we construct a receiver that can attain the ultimate capacity by applying a single-shot 
unitary transformation on the received quantum codeword followed by simultaneous (but separable) 
projective measurements on the single-modulation-symbol state spaces. We study the ultimate limits 
of photon-information-efficient communications on a lossy bosonic channel. Based on our general 
results for the pure-state quantum channel, we show some of the first concrete examples of codes and 
structured joint-detection optical receivers that can achieve fundamentally higher (superadditive) 
channel capacity than conventional receivers that detect each modulation symbol individually. 



When the modulation alphabet of a communication 
channel are quantum states, the Holevo limit is an upper 
bound to the Shannon capacity of the physical channel 
paired with any receiver measurement. Even though the 
Holevo limit is an achievable capacity, the receiver in 
general must make joint {collective) measurements over 
long codeword blocks — measurements that can't be real- 
ized by detecting single modulation symbols followed by 
classical post processing. This phenomenon of a joint- 
detection receiver (JDR) being able to yield higher ca- 
pacity than any single-symbol receiver measurement, is 
often termed as superadditivity of capacity. 

For the lossy bosonic channel, a coherent-state modu- 
lation suffices to attain the Holevo capacity, i.e., non- 
classical transmitted states do not yield any addi- 
tional capacity |1]. Hausladen et. al.'s square-root- 
measurement (SRM) which in general is a positive 
operator- valued measure (POVM), applied to a random 
code gives us the mathematical construct of a receiver 
that can achieve the Holevo limit. Lloyd et. al. [3 re- 
cently showed a receiver that can attain the Holevo ca- 
pacity of any quantum channel by making a sequence 
of "yes/no" projective measurements on a random code- 
book. Sasaki et. al. [4], in a series of papers, showed sev- 
eral examples of superadditive capacity using pure-state 
alphabets and the SRM. However, the key practical ques- 
tions that remain unanswered are how to design modula- 
tion formats, channel codes, and most importantly, struc- 
tured realizations of Holevo-capacity- approaching JDRs. 

In this paper, (i) we show that the Holevo limit of a 
pure-state channel is attained by a projective measure- 
ment, which can be implemented by a unitary operation 
on the quantum codeword followed by separable projec- 
tive measurements on the single-modulation-symbol sub- 
spaces, (ii) we translate our result into an optimal re- 
ceiver for the lossy bosonic channel, and (iii) we show 
concrete examples of codes and receivers that yield su- 
peradditive capacity for optical binary-phase-shift keying 
(BPSK) signaling at low photon numbers. These, we be- 
lieve, are the first receiver realizations that can exhibit 
superaddivity, and can be tested using laboratory optics. 



Attaining Holevo limit of a pure-state channel. 

We encode classical information using a Q-ary modu- 
lation alphabet of non-orthogonal pure-state symbols in 
= ),..., |?/;q)}. Each channel use constitutes send- 
ing one symbol. We assume that the channel preserves 
the purity of thus taking the states {iV^g)} to be those 
at the receiver. The only source of noise is the physical 
detection of the states. Assume that the receiver detects 
each symbol one at a time. Channel capacity is given by 
the maximum of the single-symbol mutual information, 

Ci = max max Ii ({pi} , |n[-^^ \] bits/symbol, (1) 

where the maximum is taken over priors {pi} over the al- 
phabet and a set of POVM operators jn^^^j, l<j<J 
on the single- symbol state- space. The measurement of 
each symbol produces one of J possible outcomes, with 
conditional probability P{j\i) = (V^i|n^-^^ To achieve 
reliable communications at a rate close to Ci, forward 
error-correction will need to be applied on the discrete 
memoryless channel with transition probabilities P{j\i). 
In other words, for any rate R < Ci, there exists a se- 
quence of codebooks Cn with K = 2^^ codewords |c/e), 
1 < k < each codeword being an n-symbol tensor 
product of states in and a decoding rule, such that 
the average probability of decoding error (guessing the 

wrong codeword), Pe^^ ~ ^ ~ ^ Xl^i = /c) ^ 0, 
as n ^ oo. In this 'Shannon' setting, optimal decod- 
ing is a maximum likelihood (ML) decision, which can in 
principle be pre-computed as a table lookup (see Fig.[T]). 
We define Cn as the maximum capacity achievable with 
measurements that jointly detect up to n symbols. The 
fact that joint detection allows for C^+m > Cn-\-Cm^ (or 
Cn > Ci) is referred to as superadditivity of capacity. The 
Holevo- Schumacher- Westmorland (HSW) theorem says, 

Coo = maxS ly2p^\^^^){H] = lim C,, (2) 
{p^} J 

the Holevo bound, is the ultimate capacity limit, where 
S{p) = — Trplog2P is the von Neumann entropy, and 
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FIG. 1: Classical communication system, shown here for 
a BPSK alphabet. If the receiver uses symbol- by-symbol 
detection, maximum capacity = Ci bits/symbol. If the 
Detection+Demodulation block is replaced by a general n- 
input n-output quantum measurement, maximum capacity 
= Cn bits/symbol. Superadditivity: Coo > Cn > Ci, where 
Coo is the Holevo limit. The joint-detection structure shown 
achieves Holevo limit for BPSK over a lossy bosonic channel. 



that Coo is achievable with joint detection over long code- 
word blocks. Calculating Coo however, doesn't require 
the knowledge of the optimal receiver measurement. In 
other words, if we replaced the detection and demodu- 
lation stages in Fig. [l] by one giant quantum measure- 
ment, then for any rate R < Coo-, there exists a se- 
quence of codebooks Cn with K = 2^^ codewords |c/c), 
1 < k < and an n-input n-output POVM over 
the n-symbol state-space jn^^^j, 1 < k < such 

~ (n) 

that the average probability of decoding error, Pe = 
1 - :^ Y^k=ii^k\i^^k''\ck) ^ 0, as n ^ oo. 

Theorem 1: For a pure-state channel, a projective mea- 
surement can attain Coo, and can be implemented as a 
unitary transformation on the codeword followed by a 
parallel set of separable single-symbol measurements. 

Proof: Consider a codebook C with K = 2^^ code- 
words |c/c), 1 < k < each codeword being an n-symbol 
tensor product of states in S. The SRM (which in gen- 
eral is a POVM) on a random codebook can achieve the 
Holevo capacity [2 . However, Helstrom showed that the 
minimum probability of error (MPE) measurement for 
discriminating K pure states with the least average prob- 
ability of error is a i^-element projective measurement on 
the span of those K states 0. By definition, the MPE 
measurement on C must achieve a lower average probabil- 
ity of error in discriminating the codewords in C than the 
SRM. Hence, the MPE measurement is capacity achiev- 
ing. In other words, given any reliable communication 



10 
9 
8 

o 

6 

2 4 
3 
2 



- ■ ■ ' ■ ' ' ■■ ■ 




* Ultimate Holevo limit 




BPSK Holevo limit 




- BPSK + Dolinar Receiver 




--- BPSK (2,3,1) code + n=2 JDR 




BPSK (2-1 ,2',2'"^) Hadamard code + JDR 




















] (10 bits per photon) \ 

1 n* = 2.66 X 10 ^ ^^N. ^ 





FIG. 2: Photon information efficiency (bits per received pho- 
ton) as a function of mean photon number per mode, n. 



threshold on the decoding error rate, Pth, there exists a 
codebook C of a long enough length n, such that the MPE 

measurement on C described by the projectors In^p^j 
with H 



(n) 

MPE,/c 



\wk){wk\ (where {\wi), . . . , \wk)} form 
a complete ortho- normal (CON) basis for span(C)), can 
attain a probability of decoding error Pe^^ = 1 — 
{^/K)Ek=i\{^k\wk)\^ < Pth. Now, let us define the 



th- 

MPE measurement on the states in 6* as H 



(1) 

MPE 



{\mq){mq\}, where {|mi), . . . , Img)} is a CON basis of 
the single-symbol state space Hi = span(6'). Let us de- 
fine the Kronecker-product CON basis of the n-symbol 
state space Hn = ^, M = {|mi), . . . , \mQn)}^ where 

V/c. Let us define another CON basis of the n- 
symbol state space Hn by extending . . . , \ wk)} — 

the vectors describing the MPE measurement on C — 
as W = {\wi),. . . ,\wk),\wk+i),' " ,\wQr.)}. Fi- 
nally, let us extend the codebook C into the set of all 
Qn length-n sequences of modulation symbols = 
{|ci), . . . , \ck), |ck+i), . . . , |cq-)} , and express each \ck) 
in both CON bases M and VK, \ck) = Ylf=i{'^jWk)\'f^j) , 
|c/c) = Yl%i{'^j\ck)\wj), where {mj\ck) = {UM)kj and 
{wj\ck) = {Uw)kj are {k^jY^ elements of the unitary 
matrices Um and Uw The bases M and W are uni- 
tarily equivalent. Thus, the MPE measurement on C 
is equivalent to first applying a unitary U on the code- 
word followed by the projective measurement described 
by M, which is essentially n parallel (and separable) 

MPE measurements IAmpe}' single- symbol sub- 

spaces. The unitary U is given by (in the basis M), 



U = '^'^u^jj,\mj){mk\; Ujk 

k=ij=i 



M 



jk ' 



(3) 



Superadditive optical receivers. Consider a single- 
mode lossy bosonic channel (such as a far-field single- 
spatial- mode free-space-optical (FSO) channel), where 
data is modulated using a succession of pulses (orthogo- 
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nal temporal modes) with mean received photon num- 
ber n per mode, where each temporal mode (pulse) 
carries one modulation symbol. The Holevo capacity 
is given by, Cuit(n) = g{n) = (1 + n)log2(l + n) - 
nlog2 n bits/symbol, which is attained using a coherent- 
state modulation, i.e., non-classical modulation states 
can't get any higher capacity pLj. Since pure loss pre- 
serves coherent states (with linear amplitude attenua- 
tion), it suffices to define capacity as a function of the 
mean photon number per received mode n, and the results 
derived above for a pure-state channel applies. Achiev- 
ing the Holevo limit requires an optimal codebook and 
joint measurement on long codeword blocks. At high 
n, symbol-by-symbol heterodyne detection asymptoti- 
cally achieves C^\t{n). The low photon number (n <C 1) 
regime is more interesting for far- field FSO communica- 
tions, where the joint-detection gain is most pronounced. 

In Fig. [2j we show the photon information efficiency 
(PIE), the number of bits that can be reliably decoded 
per received photon, as a function of n [12 . There 
is no fundamental upper bound to the PIE; however, 
higher PIE necessitates lower n. Furthermore, binary 
modulation and coding is sufficient to meet the Holevo 
limit at low n. Specifically, the binary-phase-shift key- 
ing (BPSK) alphabet Si = {\a),\ - a)}, |ap = n, is the 
Holevo-optimal binary modulation at n <C 1. Dolinar 
proposed a structured receiver that realizes the binary 
MPE projective measurement on an a pair of coherent 
states using single photon detection and coherent opti- 
cal feedback [5]. If the Dolinar receiver (DR) is used to 
detect each symbol, the BPSK channel is reduced to a 
classical binary symmetric channel (BSC) with capacity 
Ci = 1 - H{q) bits/symbol, ^ = [1 - Vl - e-4^]/2. This 
is the maximum achievable capacity when the receiver 
detects each symbol individually, which includes all con- 
ventional (direct-detection and coherent-detection) re- 
ceivers. The PIE Ci{n)/n caps out at 2/ In 2 ^ 2.89 
bits/photon at n ^ 1. Closed- form expressions and scal- 
ing behavior of Cn, the maximum capacity achievable 
with measurements that jointly detect up to n symbols, 
for n > 2 are not known. However, the Holevo limit of 
BPSK, Coo(n) = i^([l+e-2^]/2), can be calculated using 
Eq. ([2|. Good codes and JDRs will be needed to bridge 
the huge gap between the PIEs Ci{n)/n and Coo{n)/n^ 
shown in Fig. |2] It is interesting to reflect on the point 
shown by the orange circle (at 10 bits/photon) in Fig. |2j 
which says that for a 1.55/im far- field FSO system operat- 
ing at 1 GHz modulation bandwidth, the laws of physics 
permit reliable communications at 0.266 Gbps with only 
3.4 pW of average (and peak) received optical power! 

A two-symbol superadditive JDR — Some examples 
of superadditive codes and joint measurements have 
been reported [H [6^, but no structured receiver de- 
signs. An ensemble (a (2,3,1) inner code [13 ) con- 
taining three of the four 2-symbol BPSK states, ^2 = 
|a)| — a), I — with priors (1 — 2p,p,p), 
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FIG. 3: A two-symbol JDR that attains ~ 2.5% higher capac- 
ity for BPSK than the best single-symbol (Dolinar) receiver. 



< p < 0.5, can attain, with the best 3-element pro- 
jective measurement in span(S2), up to ^ 2.8% higher 
capacity that Ci [6]. Since this is a Shannon capacity 
result, a classical outer code with codewords comprising 
of sequences of states from 5*2 will be needed to achieve 
this capacity I2 > Ci. Using the MPE measurement on 
5*2 (which can be analytically calculated ^, unlike the 
numerically optimized projections in |6]), /2/C1 ^ 1.0266 
can be obtained. A receiver that involves a unitary op- 
eration on the [2,3,1] code (a beamsplitter) followed by 
two separable single-symbol measurements (in this case, 
a single-photon detector (SPD), and a DR) (see Fig. [3|, 
can attain h/Ci ^ 1.0249 (see Fig. [2|. Its likely that 
none of these projective measurements on S2 attain C2, 
since the single-shot measurement that maximizes the ac- 
cessible information in S2 could in general be a 6-element 
POVM [HI. However, Theorem 1 proves that as the size 
of the inner code n ^ 00, a projective measurement on 
the codebook that involves an n-mode unitary followed 
by a DR-array is capable of attaining Coo, without any 
additional outer code (see Fig. [T]). 

An n-symbol superadditive JDR — A (2^ - 1,2^,2^"-^) 
BPSK Hadamard code, with n-mean-photons BPSK 
symbols, has the same geometry (mutual inner-products) 
and thus is unitarily equivalent to the (2^, 2^, 2^"-^) pulse- 
position-modulation (PPM) code with 2^ n- mean-photon 
PPM pulses. The former is slightly space- efficient, since 
it achieves the same equidistant distance profile, but with 
one less symbol. Consider a BPSK Hadamard code de- 
tected by a 2^-mode unitary transformation (with one 
ancilla mode prepared locally at the receiver, in the \a) 
state) built using (nlog2n)/2 50-50 beamsplitters ar- 
ranged in a fast-fourier-transform (EFT) butterfly cir- 
cuit, followed by a separable n = 2^-element SPD-array, 
as shown (for n = 8) in Fig. [4ja). The beamsplitter 
array is reminiscent of the fast Walsh Hadamard trans- 
form, if we recall that each 50-50 beamsplitter imple- 
ments an order-2 EFT. The beamsplitters 'unravel' the 
BPSK codebook into a PPM codebook, separating out 
the photon energies in spatially separated bins, making 
possible discriminating the codewords using a SPD array. 
This receiver design is a more 'natural' choice for spatial 
modulation, across, say orthogonal spatial modes of a 
near-field FSO channel. The ancilla mode at the receiver 
necessitates a local oscillator phase locked to the received 
pulses, which is hard to implement. Since the number 
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FIG. 4: (a) The BPSK (7,8,4) Hadamard code is unitarily 
equivalent to the (8,8,4) pulse-position- modulation (PPM) 
code via a Walsh transform built using twelve 50-50 beam- 
splitters, (b) Bit error rate plotted as a function of n. 



of ancilla modes doesn't scale with the size of the code, 
we can append the ancilla mode to the transmitted code- 
word, so that the received ancilla can serve as a pilot tone 
for our interferometric receiver. The Shannon capacity 
of this code- JDR pair — allowing coding over the erasure 
outcome (no clicks registered at any SPD element) — is 
^n(^) = (log2 i^/i^)(l — exp(— 2(in)) bits/symbol. In 
Fig. |2j we plot the envelope, max^/nC^)/'^ (the green 
dotted plot), as a function of n. This JDR not only at- 
tains a much higher superadditive gain than the n = 2 
JDR we describe above, it doesn't need phase tracking 
and coherent optical feedback like the DR. Note that n- 
ary PPM signaling also achieves the capacity /n(^) with 
a SPD receiver, albeit with a much higher (x2^) peak 
power as compared to BPSK. However, Theorem 1 says 
that the receiver construct shown in Fig. [l] is capable of 
bridging the rest of the gap to the Holevo limit (i.e., the 
blue plot in Fig.|2| in conjunction with an optimal BPSK 
code. Theorem 1 applies readily to any higher-order mod- 
ulation format (Q > 2, required to achieve capacity at 
higher n), where the DR in the JDR structure must be 
replaced by an extension of the DR that discriminates Q 
modulation symbols at their MPE limit (not known yet). 
Finally, note that in the proof of Theorem 7, the MPE 
measurement on the single-symbol space Hmpe^ was just 
a convenient choice; any other projective measurement 
would've worked. In the optimal JDR for BPSK there- 
fore, the DRs can be replaced by an array of Kennedy re- 
ceivers (which applies a coherent shift by —a followed by 



SPD — a lot simpler than the DR), since it also performs 
a projective measurement on span(6'i). In Fig. [4jb), we 
plot the bit error rates Ph{E) as a function of n for un- 
coded BPSK, and the [255,256,128] BPSK Hadamard 
code, when detected using a symbol- by-symbol DR and 
our structured JDR, respectively. The coding gain now 
has two components, a (classical) coding gain, and an 
additional joint- detection gain. 

A great deal is known about binary codes that achieve 
low bit error rates on the BSC at n very close to the 
Shannon limit [9 . It would be interesting to see how 
close to the Holevo limit can these same codes perform, 
when paired with their respective quantum MPE mea- 
surements. It will be useful to design codes with symme- 
tries that allow them to approach Holevo capacity, with 
the unitary U of the JDR in Fig. [T] realizable via a sim- 
ple network of beamsplitters, phase shifters, two- mode 
squeezers, and Kerr non-linearities (which form a uni- 
versal set for realizing an arbitrary multimode bosonic 
unitary ^2) along with a low-complexity outer code, if 
at all. The fields of information and coding theory have 
had a unique history. Even though many of its ultimate 
limits were determined in Shannon's founding paper [Hj, 
it took generations of magnificent coding theory research, 
to ultimately find practical capacity-approaching codes. 
Even though realizing reliable communications on an op- 
tical channel close to the Holevo limit might take a while, 
it certainly does seem to be in the visible horizon. 
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